A new local covariance matrix estimation for the classification of gene expression profiles in high dimensional RNA-Seq data
نویسندگان
چکیده
Recent developments in the next-generation sequencing based on RNA-sequencing (RNA-Seq) allow researchers to measure expression levels of thousands genes for multiple samples simultaneously. In order analyze these kinds data sets, many classification models have been proposed literature. Most existing classifiers assume that are independent; however, this is not a realistic approach real RNA-Seq problems. For reason, some other methods, which incorporates dependence structure between into model, proposed. Quantile transformed Quadratic Discriminant Analysis (qtQDA) recently one those classifiers, estimates covariance matrix by Maximum Likelihood Estimator. However, MLE may reflect genes. we propose new local function estimate be used qtQDA model. This assumes dependencies locally defined rather than complete dependency. The performances classifier two different compared over terms error rates. results show using yields better and increases performance classifier.
منابع مشابه
The challenge of getting a high quality of RNA from oocyte for gene expression study
The extraction of intact RNA from oocyte is quite challenging and time-consuming. A standard protocol using commercial RNA extraction kit, yields a low quantity of RNA in oocytes. In the past, several attempts in getting RNA for gene expression study ended up with a few different modified methods. Extraction of high-quality RNA from oocyte is important before further downstream analyses such as...
متن کاملA new test for sphericity of the covariance matrix for high dimensional data
AMS subject classifications: 62H10 62H15 Keywords: Covariance matrix Hypothesis testing High-dimensional data analysis a b s t r a c t In this paper we propose a new test procedure for sphericity of the covariance matrix when the dimensionality, p, exceeds that of the sample size, N = n + 1. Under the assumptions that (A) 0 < trΣ the concentration, a new statistic is developed utilizing the rat...
متن کاملEstimation of the Covariance Matrix of Large Dimensional Data
This paper deals with the problem of estimating the covariance matrix of a series of independent multivariate observations, in the case where the dimension of each observation is of the same order as the number of observations. Although such a regime is of interest for many current statistical signal processing and wireless communication issues, traditional methods fail to produce consistent es...
متن کاملthe clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
data mining rules and classification methods in insurance: the case of collision insurance
assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Expert Systems With Applications
سال: 2021
ISSN: ['1873-6793', '0957-4174']
DOI: https://doi.org/10.1016/j.eswa.2020.114200